Exploratory model analysis with R and GGobi

نویسنده

  • Hadley Wickham
چکیده

Why do we build models? There are two basic reasons: explanation or prediction [Ripley, 2004]. Using large ensembles of models for prediction is commonplace, but is rarely used for explanation, where we typically choose one “best” model. When there are several equally good models, it is common sense to look at them too, but can the “bad” models tell us something as well? This paper describes exploratory model analysis for ensembles of linear models, where we look at all possible main effects models for a given dataset (or a large subset of these models). This gives greater insight than looking at any small set of best models alone: an ensemble of many models can tell us more about the underlying data than any individual model alone. This paper builds heavily on exploratory modelling analysis as introduced by Unwin et al. [2003], but rather than describing different types of useful plots it is organised around different levels of data. We will assume we have m models describing a data set with n observations and p variables. If all possible main effects models are fit, there will be 2p − 1 models in the ensemble. After fitting the models, we compute summary statistics on four levels:

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Software Integration for Multivariate Exploratory Spatial Data Analysis

This paper describes a decade’s worth of evolution of integrating software to support exploratory spatial data analysis (ESDA) where there are multiple measured attributes. The multivariate graphics tools we use are XGobi, and more recently, GGobi. The paper is divided into two parts. In the first part, we review early experiments in software linking for ESDA, which used XGobi, different Geogra...

متن کامل

GGobi: XGobi Redesigned and Extended

GGobi is a direct descendant of XGobi, with multiple plotting windows, a color lookup table manager, an XML (Extended Markup Language) file format for data, and other changes. Perhaps the biggest change is that GGobi can be embedded in other software and controlled using an API (Application Programming Interface). This design has been developed and tested in partnership with R. When GGobi is us...

متن کامل

GGobi meets R: an extensible environment for interactive dynamic data visualization

GGobi is a direct descendant of XGobi, designed so that it can be embedded in other software and controlled using an API (application programming interface). This design has been developed and tested in partnership with R. When GGobi is used with R, the result is a full marriage between GGobi’s direct manipulation graphical environment and R’s familiar extensible environment for statistical dat...

متن کامل

Exploratory Visual Analysis of Graphs in GGobi

Graphs have long been of interest in telecommunications and social network analysis, and they are now receiving increasing attention from statisticians working in other areas, particularly in biostatistics. Most of the visualization software available for working with graphs has come from outside statistics and has not included the kind of interaction that statisticians have come to expect. At ...

متن کامل

Exploring cluster analysis

This paper presents a set of tools to explore the results of cluster analysis. We use R to cluster the data, and explore it with textual summaries and static graphics. Using Rggobi2 we have linked R to GGobi so that we can use the dynamic and interactive graphics capabilities of GGobi. We then use these tools to investigate clustering results from the three major families of clustering algorithms.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006